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ABSTRACT 

;The purpose of this study vas to evaluate a practical 
procedure for deltermining the optimum sequencing of pictorial 
classification tasks in science using mean difficulty indices 
generated by a standard sample of grade two students. Acquisition of 
such skills can affect the learner's performance on subsequent 
material and the teacher's efforts in diagnosing specific learner 
problems. It was predicted that: (1) an ascending order of difficulty 
in the sequence vould be more effective than a random order an^ (2) a 
random order of task difficulty would be a more effective sequence 
than a descending order. Fifty-one grade two subjects constituted the 
standard sample. The experimental sample was composed of another 
group of ^3'4 second grade subjects from the same school and from a 
nearby school. Subsequently, the experimental subjects were randomly 
assigned to three treatment groups of ascending, randomized and 
descending orders of task difficulty. The effects of three 
presentation sequences of the science tasks were experimentally 
evaluated in terms of total student performance. The results 
indicated that a highly consistent series of a pictorial 
classification tasks sequenced in an ascending order of difficulty 
was more effective than a random order and that a random order ^fas 



more effective than a descending order of difficulty, (Authc 
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Introduction 

The sequencing of science materials under instructive and testing con- 
ditions represents an area commonly investigated by educational researchers. 
The practical purpose behind such investigations in science education is 
twofold. First, selected science materials that are properly sequenced 
can facilitate the student's efforts in acquiring intellectual skills (Gagne, 
1973). Acquisition of such skills can affect the learner's performance on 
subsequent material. In theory, easier tasks that are followed by more 
difficult tasks of a similar nature allow the learner to draw upon his mem- 
ory and also be reinforced by his successes as he proceeds through a sequence 
of tasks. Secondly, science materials that are properly sequenced can 
facilitate the teacher's efforts in diagnosing specific learner problems. 
The purpose of this study is to evaluate a rather practical procedure for 
determining the optimum sequencing of pictorial classification tasks using 
mean difficulty indices generated by a standard sample of grade two students. 
The effects of three presentation sequences of the science tasks were ex- 
, perimentally evaluated in terms of total student performance. 

It was hypothesized that multiple classification tasks such as those 
generally pictured in modern elementary school science programs are most 
effectively presented in an ascending order of difficulty, as defined by 
statistically obtained mean item difficulty values from a sample of grade 
two subjects. Random ordering was predicted to be less than best, while a 
descending order was expected to constitute the poorest order- Tasks gen- 
erally associated with these multiple classification skills require the 
child to attend to two or more stimulus dimensions (e.g., color, shape) in 
a simultaneous fashion. Children's capacity to perform these tasks have 
been identified as a key ability directly related to the Piagetian jtage of 
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concrete operations (Inhedler and Piaget, 1964), to "non-Piagetian" cognitive 
development theory (Glaser arid Resnick, 1972), and to general non-verbal 
intelligence (Raven, 1956) . It appears logical that learners who encounter 
an ascending order of difficulty rather than a random or a descending se- 
quence should acquire appropriate problem- solving techniques, subordinate^ 
skills and positive anticipations of success that can be expected to fac- 
ilitate performance on subsequent plassif ication tasks of a highly similar 
nature. This explanation could be related to work done n set induction 
and direct transfer effects within learning hierarchies. Interestly, 
psychometric research on test item sequencing does not currently support 
predictions of significant differences among the three orders of items 
sequencing mentioned above. 

Method 

Subjects 

^ Fifty-one grade two subjects from a Calgary (Alberta, Canada elementary 
school constituted the standard sample. The experimental sample was com- 
posed of another 134 second grade subjects from the sarae school and from a 
nearby school. Subjects ranged in age from seven to eight years and came 
from middle class, English speaking families. 

Materials 

Fifty-six multiple classification tasks were developed for this study. 
Each task required subjects to sort pictures of domestic animals on the 
bases of two or more stimulus properties (i.e., position, size, shading 
and size). The selection of the animal classifications were based on fam- 
iliar variations of typical animal groupings found throughout North America. 
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The stimulus material for each ta^ consisted of three different positive 



inatances of a class of animals pictorially displayed in the top half of 
an 8% X 11 inch sheet of paper. These positive instances varied in terms 
of two or more stimulus dimensions as defined in Table 1. In addition, 
three different negative instances and a new positive instance were dis- . 
played in the bottom half of the same paper as illustrated in Figure 1. 
The subject's task was to select the new positive instance from among the 
four instances. The successful subject selected the new positive instance 
from among these, four choices in the bottom row based on the relative im- 
portance of the animal characteristics illustrated in the top row of each 
paper. The logical hierarchical order of animal characteristics was: 
shape, shading, size and portion. For example, the stimulus class shown 
in Figure 1 represents a typical specimen aijd consists of three furry dogs 
with shaded ears and spots of varying size. The relevant stimulus dimensions 
are shape (furry dog) and shading (shaded ears and spots). The irrelevant 
stimulus dimensions are size and position because more than one value of 
each is displayed among the positive instances in the top row of this il- 
lustrated example. More generally, subjects always had to attend to the 
shading and position dimensions in some manner throughout the task series. 



Each subject from the standard sample received 56 randomized tasks in 
an individually sequenced order. Subsequent item analysis of the results 
yielded task difficulty values (i.e., the percentage of subjects who cor- 
rectly performed a task) which were used to rank each task. Preliminary 




TABLE 1 AND FIGURE 1 HERE 



Procedure 
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observatlons in the present study and in a highly similar experiment 
(Friebergs and Tulving, 1961) suggested that children generally begin to 
perform selected classification tasks with little problem after about the 
twentieth item. Therefore, the 56 ranked tasks were placed in one of 20- 
five percentile intervals of mean item difficulty. Subsequently, one 
task was random selected from each cell which was used in the experimental 
phase of this study « 

The inclusion of the experimental subject's response in the data 
analysis was contingent upon his ability to perform two preliminary tasks. 
These plant classification tasks were judged very easy by the experimenters 
and equivalent in nature to the subsequent animal (classification tasks 
except for the content material. It was assumed that subjects who were 
unable to perform these two preceding tasks would be unable to perform the 
subsequent animal classification tasks* The presentation of initially 
"^easier tasks is the requirement prescribed by Ebel (1972) in his recommended 
procedure for sequencing test items. 

The experimental subjects were randomly distributed to three treatment 
groups* The ascending treatment consisted of a task order from the easiest 
to the most difficult item based on the standard subjects* performance on in- 
dividual tasks. The randomized treatment consisted of a random ordering of 
tasks for each subject. The descending treatment consisted of an order 
opposite to the ascending treatment sequence. The standard and experimental 
subjects were instructed to pretend that they were selecting different kinds 
of animals for a children's zoo. They were told to print an "x?' on the 
pictured animal in the bottom row of each paper which was the same kind of 
animal as those three in the top row of the same paper. A sample task using 
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an analogous geometric stimuli task was displayed on a 20 by 26 iach display. 
Subjects were told the correct bhoice by an experimenter. Subsequently, 
subjects were instructed to perform the two plant classification tasks and 
thei^the animal tasks. The experimenters did not encourage subjects to 
hurry through the task series. 

Results 

Seventeen items (the first two and last percentile interval of the 
I mean Item difficulty range resulted in empty cells) of the 56 normative 
tasks were used in the three experimental treatments. Four standardized 
and six experimental subjects failed to perform one or both preliminary 
tasks and were eliminated from the analysis. Two other experimental sub- 
jects were unable to complete the 17 tasks in a reasonable amount of time. 
Item analysis of collected data from this study is shown in Table 2. The 
internal consistency reliability (Cronbach's Alpha) was .96 for the standard 
sample and .72 for the randomized experimental sample of the study. Two 
orthogonal comparisons of the means using analysis of variance supported 
the two a priori hypotheses as shown in Table 3. As predicted, the 
ascending treatment scores were significantly higher (F»14.28, df=l;88, 
p<C«001) than the randomized treatment scores which, in turn, were sig- 
nificantly higher (F=4.68, df«l;87, p<.05) than the descending treatment 
acorea. 

[rabies 2 and 3 Here^ 
Discussion 

These results suggest that a highly consistent series of a pictorial 
classification tasks sequenced in an ascending order of difficulty is more 
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effectlve than a random order and that a random order Is more effective than 
a descending order of difficulty. Consequently, this procedure of sequencing 
science materials represents a more promising practical approach than many 
other commonly suggested methods. Subsequent behavioral analyses of the 
individual tasks and construct validation of the task series should in- 
crease its utility in terms of future science education experiments which 
require multiple classification of consistent, concrete and familiar class- 
room material. The experimental tasks incorporate ^re concrete, familiar 
concepts than the abstract, colored matrices used in the Raven Coloured Pro- 
gressive Matrices Tests (Raven, 1956). The Raven test is the instrument 
typically used in laboratories and schools to assess the multiple class- 
ification ability and the non-verbal intelligence of young children. The 
concrete tasks developed for the present study required children to sort 
pictures of domestic animals on the basis cf two or more stimulus properties 
(i.e., position, shape, size, and shading). The teaching of such concrete 
intellectual skills has direct extra-school relevance and represents the 
type of material that should be taught in the elementary schools according 
to Rohwer (1971). Instructive tasks highly similar to the ones explored 
in the present study, can be found in Science - A Process Approach (See 
A.A.A«S. , 1967), Science Curriculum Improvement Study (See S.C.I.S., 1968) 
and Elementary Science Study (See E.S.S., 1968). 

Previous classification research in science education has been largely 
based in Piagetian theory and has examined such topics as the differential 
effects of varying the response made between the instructive and evaluative 
components (Popp and Raven, 1972)., relative performance by middle and lower 
socioeconomic class children (P^ven, 1968) , and practice effects on simple 
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and complex classification rules (Raven, 1970). Multiple 
classification research more directly relevant to the present study has 
been based on Piagetian and behavioral analysis theory. Information gained 
from these studies have mainly dealt with investigation of hierarchical 
positions and imEortance of this skill (Kofsky, 1966), modification of 
children's ability to perform the skill (Klaismeier and Hoover, 1974) and 
training programs used to facilitate transfer to generalized standardized 
tasks Such as the Raven Colored Progressive Matrices Test (Jacobs and Van- 
devender, 1971a; 1971b). One of the most interesting studies in this area 
was reported by Resnlck, Siegal and Kresh (1971) in which two double clas- 
sification tasks of a similar nature were carefully analyzed and compared. 
Kindergarten subjects learning the easier task first needed fewer trials 
on the first task than those subjects who learned the tasks in the reverse 
order. However, subjects who learned the more complex task first were 
better able to learn the less complex task second. These results were 
consistent with a science education experiment performed by Raven (1970) 
in which 12 classification rules were taught to third grade children. 
Further, discussion will concentrate ©n the experimental results in light 
of work done in psychometry, set induction and learning hierarchies. 

The effects of varying the order (i.e., ascending, random and occasional- 
ly descending) of classroom materials has been explored, in numerous psycho- 
metric studies of achievement power (nonspeeded) tests with a consistent 
result of no significant differences. According to Ebel's (1972) review of 
this topic, the order of material such as that found in school tests does 
not make a significant difference in the student's scores as long as the first 
one or two test items are relatively easy. More specifically, Marso, (1970) 
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X In tvo experiments, found no differences among college subjects using over 
100 vocabulary test items with a wide difficulty range. Furthermore, Sax 
and Cormack (1966) and Munz and Smoure (1968) found no differences between 
sequences. Sirotnik and Wellington (1974) investigated a similar question 
utting content as an index of order with grade 8 subjects responding on 
mathematics, science, social science, reading and language arts achievement 
tests. This was the only study in which reliability coefficients were 
reported. This statistic is of particular importance when evaluating the 
potential effects items or tasks have on one another. . No- ordering effects 
nor significant differences in estimates of examining percentage correct 
score, variance estimates of item difficulty, or K-R 20 estimates of internal 
consistences were found. Furthermore, such similar effects of pictures as 
stimulus material rather than objects or verbal synlbols is unexplored in 
the psychometric literature. Pictorial stimuli have been used In numerous 
other science education experiments (Hd|liday, 1973). However, most of the 
work has involved instructional variables with secondary school science 
subjects (See: Holliday, 1975 in press; Holliday and Harvey, in press; 
Koran, 1973; Wilson, 1973). Therefore, the extent to which the present 
classification experiment is generalizable to other stimulus modes shpuld 
also be investigated at all levels of cognitive development. f 

The differential student performance from the three task orders were 
likely the result of cognitive and affective set and direct transfer effects 
from a possible learning hierarchy. The specific contribution of each of 
these elements to the success of the differential order presentations is cur- 
rently a matter of conjectur.e and not a direct concern of this study. 
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Set induction occurs vhen a learner becomes predisposed to view and 
Approach tasks in a manner affected by previous experiences. The influence 
of a cognitive set can have either a positive effect (See Wittlock, 1963; 
Schuck, 1970, and 1971; and Torrance and Harman, 1961) or a negative ef- 
fect (See Luchins, 1942, and Birch and Rabinowitz, 1951) • Experiments in 
set induction suggest that subjects, performing a series of internally 
consistant tasks, are better able to assimilate and organize the stimulus 
materials (KacDonald, 1965). It is also a well established phenomena that 
previous successful experiences generate increased levels of expectation and 
self-confidence which, in turn, can lead to additional success with sim- 
iliar problems (DeCecco, 1968). Children who anticipate success as a result 
of previous experience seem to increase their chances of success on subsequent 
similar tasks. Two critical ingredients consistent with cognitive and af- 
fective set theory can be found in this study. First, set cannot be induced 
without consistency among problems. Consistency among the intellectual 
tasks in this study was clearly established by the extremely high reliability 
coefficients. Second, different degrees of success were generally experienced 
by the experimental subjects depending upon their assigned treatment. These 
two characteristics of the present study support the importance of set theory 
under conditions of varied sequencing of pictorial classification tasks. 

Findings in this study suggest that mean difficulty values obtained from 
a standard sample pop^jlation constituted an effective procedure for determining 
optimum order of pictorial classification tasks in science and might be 
pertly explained in terms of direct transfer effect. The practical con- 
sequences of such correlational methods relative to the more common pro- 
cedure delineated by Gagne (1970) have "suggestive utility for instructional 
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purposes" (Glaser et al, 1972), but their future Importance is "unclear'* 
At this time (Gagne, 1973). Such research has mainly dealt with rigorous 
validation procedures for determining specif ic ^relationships among in- 
dividual tasks (White, 1974a). White (1974b) has outlined a complex model 
for such a validation procedure. In addition. White (1974c) discussed 
his method in terms of other indexes evaliiated in science education, such 
as the classical work of Capie and Jones (1971). Of course, tne objective 
in the present study was to evaluate a more practical and lesr rigorous 
method of sequencing a given set of tasks and not to evaluate a procedure 
for validating a learning hierarchy as prescribed by Gagne (1970) and 
later by White (1974b) . 

Finally, learning hierarchy research (Gagne, 1973) suggests that the 
sequence of classroom materials is more likely to have practical consequences 
under: three conditions. First, the materials should be associated with intel- 
lectual skills or cognitive strategies (as called by some theorists). 
Second, small units of materials should be evaluated rather than entire 
school programs or curriculums. Third, learners who are unsophisticated 
readers or relatively young are more likely to be influenced by differential 
sequencing of classroom materials. These three elements were contained with- 
in the experimental conditions of the present study, which probably increased 
Che change of direct transfer effects. In conclusion, the three presentation 
orders of difficulty were experimentally shown to have a differential effect 
on total student performance. These results indicated that the way a teacher 
sequences a large number of pictorial' classification tasks in science can 
make a real difference. 
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TABLE 3 

Observed Means and Standard Deviations on the 
EKperimental Group Scores 

Treatment Groups n X S, D. 

Ascending 45 12.47 1.93 

Randoo^.2ed 45 10.51 £.89 

Descending 44 8.89 4.10 
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